Using Multiple Imputation Technique to Correct for Measurement Error and Statistical Disclosure Control in Sensitive Count Data in a National Survey

نویسنده

  • Mandi Yu
چکیده

Measurement error in sensitive question is pervasive, therefore, biasing the estimation of most statistical models. The objective of this paper is to correct for measurement error in the number of life-time sexual partners by treating it as a missing data problem and using multiple imputation technique to synthesize this underlying “true” attribute. Bayesian Poisson model with diffuse Gaussian priors was fitted to the 1996 General Social Survey combining knowledge of data quality from the mode experiment conducted by Tourangeau and Smith (1996). Ignored in existing literature, the threat of augmented disclosure harm from releasing both imputed and original data to the public was recognized and tackled by statistical perturbation. Bias reduction and statistical integrity were evaluated. Markov Chain Monte Carlo algorithm was programmed using WinBUGS.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonresponse prediction in an establishment survey using combination of statistical learning methods

Nonrespose is a source of error in the survey results and national statistical organizations are always looking for ways to control and reduce it. Predicting nonrespons sampling units in the survey before conducting the survey is one of the solutions that can help a lot in reducing and treating the survey nonresponse. Recent advances in technology and the facilitation of complex calculations...

متن کامل

Chapter 8 Multiple Imputation and Disclosure Protection : TheCase of the 1995 Survey of Consumer Finances Arthur

Donald Rubin has suggested many times that one might multiply impute all the data in a survey as means of avoiding disclosure problems in public-use datasets. Disclosure protection in the Survey of Consumer Finances is a key issue driven by two forces. First, there are legal requirements stemming from the use of tax data in the sample design. Second, there is an ethical responsibility to protec...

متن کامل

Multiple imputation: an alternative to top coding for statistical disclosure control

Top coding of extreme values of variables like income is a common method of statistical disclosure control, but it creates problems for the data analyst. The paper proposes two alternative methods to top coding for statistical disclosure control that are based on multiple imputation. We show in simulation studies that the multiple-imputation methods provide better inferences of the publicly rel...

متن کامل

Feasibility of using statistical tests in evaluation of non-uniformity [Persian]

Introduction: Non-uniformity test is essentially the only required daily QC procedure in nuclear medicine practice. Noise creates statistical variation or random error in a flood image. Non-uniformity on the other hand does not have statistical nature and may be regarded as systemic error. The present methods of non-uniformity calculation do not distinguish between these two types of erro...

متن کامل

Combining synthetic data with subsampling to create public use microdata files for large scale surveys

To create public use files from large scale surveys, statistical agencies sometimes release random subsamples of the original records. Random subsampling reduces file sizes for secondary data analysts and reduces risks of unintended disclosures of survey participants’ confidential information. However, subsampling does not eliminate risks, so that alteration of the data is needed before dissemi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007